-
Notifications
You must be signed in to change notification settings - Fork 161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve fsspec DataPipe to accept extra keyword arguments #495
Conversation
@ejguan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
else: | ||
abs_path = file_name |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just to confirm - this never happens if is_local == True
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good question. I can't guarantee it since I am not aware of all use cases.Based on my experience on fsspec
, I only encounter the problem when the input is s3
url.
For local files, I would trust the test here https://github.com/pytorch/data/blob/main/test/test_fsspec.py#L56-L65. As long as it doesn't break, I think
@ejguan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
Summary: Fixes pytorch#494 ### Changes - Add `kwargs` to all `DataPipe`s using `fsspec` - Fix a bug in `FSSpecFileLister` to prevent joining paths from the root path if the root path has already presented in the paths. - Add/Fix more tests I need this PR to test the performance between `fsspec` and native `s3` Pull Request resolved: pytorch#495 Reviewed By: NivekT Differential Revision: D36908570 Pulled By: ejguan fbshipit-source-id: 93da4c1c7e18012fb053799265375124d28a856c
Summary: Fixes #494 ### Changes - Add `kwargs` to all `DataPipe`s using `fsspec` - Fix a bug in `FSSpecFileLister` to prevent joining paths from the root path if the root path has already presented in the paths. - Add/Fix more tests I need this PR to test the performance between `fsspec` and native `s3` Pull Request resolved: #495 Reviewed By: NivekT Differential Revision: D36908570 Pulled By: ejguan fbshipit-source-id: 93da4c1c7e18012fb053799265375124d28a856c
Fixes #494
Changes
kwargs
to allDataPipe
s usingfsspec
FSSpecFileLister
to prevent joining paths from the root path if the root path has already presented in the paths.I need this PR to test the performance between
fsspec
and natives3